Learning extended FSA from software: An empirical assessment

نویسندگان

  • David Lo
  • Leonardo Mariani
  • Mauro Santoro
چکیده

A number of techniques that infer finite state automata from execution traces have been used to support test and analysis activities. Some of these techniques can produce automata that integrate information about the data-flow, that is, they also represent how data values affect the operations executed by programs. The integration of information about operation sequences and data values into a unique model is indeed conceptually useful to accurately represent the behavior of a program. However, it is still unclear whether handling heterogeneous types of information, such as operation sequences and data values, necessarily produces higher quality models or not. ehavioral models In this paper, we present an empirical comparative study between techniques that infer simple automata and techniques that infer automata extended with information about data-flow. We investigate the effectiveness of these techniques when applied to traces with different levels of sparseness, produced by different software systems. To the best of our knowledge this is the first work that quantifies both the effect of adding data-flow information within automata and the effectiveness of the techniques when varying sparseness of traces. quantified constraints, compared to simple FSAs. However it is . Introduction In the recent years, several techniques that automatically derive ehavioral models from execution traces have been used to suport validation and verification activities (Lorenzoli et al., 2008; allmeier et al., 2006; Ernst et al., 2001; Hangal and Lam, 2002; enkel and Diwan, 2003; Mariani and Pastore, 2008; Raz et al., 002; Ammons et al., 2002). A kind of model that is both comonly used to represent program behaviors and largely supported y model generation techniques is Finite State Automaton (FSA) Lorenzoli et al., 2008; Dallmeier et al., 2006; Mariani and Pastore, 008; Ammons et al., 2002; Biermann and Feldman, 1972). Recent results highlighted that many program behaviors canot be accurately represented with FSAs, but require models that nclude not only the information about the possible sequences f events executed by programs but also data-flow information Lorenzoli et al., 2008; Mariani and Pastore, 2008; Mariani et al., 011a). For instance, the parameters that a program uses to open file can influence the way the file will be successively used (e.g., pening a file with the read option does not allow values to be writen into the file). Simple FSAs can represent the different ways a file ∗ Corresponding author. E-mail address: [email protected] (L. Mariani). 164-1212/$ – see front matter © 2012 Elsevier Inc. All rights reserved. ttp://dx.doi.org/10.1016/j.jss.2012.04.001 © 2012 Elsevier Inc. All rights reserved. can be used (e.g., reading or writing), but cannot relate the specific usage to the opening mode. For instance gkTail (Lorenzoli et al., 2008) and KLFA (Mariani and Pastore, 2008) are two techniques that can extract FSAs that incorporate data-flow information. In particular, gkTail builds FSAs where transitions are annotated with data-flow information consisting of algebraic constraints. The algebraic constraints associated with transitions specify the concrete values that can be assigned to attributes, thus representing with a single transition a large (possibly infinite) set of events. Fig. 1 shows an example EFSA1 inferred by gkTail. KLFA builds simple FSAs with transition labels that incorporate data-flow information consisting of universally quantified constraints. Universally quantified constraints specify how data values can reoccur across events. Also in this case the inferred FSA can represent a (possibly infinite) set of events with a single transition. Fig. 2 shows an example EFSA inferred by KLFA. The models generated by KLFA and gkTail can effectively represent data-flow information through algebraic and universally still unclear whether automatically inferred models extended with data-flow information are generally more accurate than simple 1 In the rest of the paper we will interchangeably use the terms extended FSA (EFSA) and FSA with data-flow information to indicate models extended with either algebraic or universally quantified constraints. 2064 D. Lo et al. / The Journal of Systems and

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Perceived need and actual usage of the family support agreement in rural China: results from a nationally representative survey.

PURPOSE The Family Support Agreement (FSA) is a voluntary but legal contract between older parents and adult children on parental support in China. As the first comprehensive empirical study on the FSA, this study aims to understand the prevalence and covariates of older parents' perceived need and actual use of this agreement. DESIGN AND METHODS Using logistic regression analyses, t-tests an...

متن کامل

1 Environmental Risk Evaluation Criteria

This paper describes an approach to incorporating environmental risk evaluation criteria within IMO’s guidelines for Formal Safety Assessment (FSA). Such criteria are currently absent from FSA, and the discussion to include them has just started. Said criteria are relevant for evaluating on a cost-benefit basis Risk Control Options (RCOs) for reducing oil spill pollution risk. Oil pollution may...

متن کامل

Co-regularization Based Analysis of Feature Sharing Algorithms

A common approach in domain adaptation (DA) [1] and multitask learning (MTL) [2] is to create an expanded feature representation by sharing features across domains (in DA) or across tasks (in MTL) and then learning a classifier over this expanded feature set. In this paper, we refer to such techniques as feature sharing algorithms (FSA). One such FSA is EASYADAPT [1], which takes each feature i...

متن کامل

FSA/KB: an Object-Oriented Knowledge Base System

FSA/KB is an Object-Oriented Knowledge Base System of the Software Development Environment FSA (Fudan Software Automation). It manages various kinds of knowledge in an uniform Object-Oriented fashion. The knowledge it manages includes: descriptions of all FSA components, programming knowledge in rule form, user domain knowledge and products in the process of program transformation (eg., speciic...

متن کامل

Dynamic and Quantitative Method of Analyzing Service Consistency Evolution Based on Extended Hierarchical Finite State Automata

This paper is concerned with the dynamic evolution analysis and quantitative measurement of primary factors that cause service inconsistency in service-oriented distributed simulation applications (SODSA). Traditional methods are mostly qualitative and empirical, and they do not consider the dynamic disturbances among factors in service's evolution behaviors such as producing, publishing, calli...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of Systems and Software

دوره 85  شماره 

صفحات  -

تاریخ انتشار 2012